POLYNOMIAL CONFIGURATIONS IN THE PRIMES 
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Abstract. The Bergelson-Leibman theorem states that if Pi,...,Pfc G Z[a;], then any 
subset of the integers of positive upper density contains a polynomial configuration x + 
Pi (m) , . . . ,x + Pk (m) , where x,m £ Z. Various generalizations of this theorem are known. 
Wooley and Ziegler showed that the variable m can in fact be taken to be a prime minus 
1, and Tao and Ziegler showed that the Bergelson-Leibman theorem holds for subsets of the 
primes of positive relative upper density. Here we prove a hybrid of the latter two results, 
namely that the step m in the Tao-Ziegler theorem can be restricted to the set of primes 
minus 1. 



1. Introduction 

Roughly twenty years after the ergodic theoretic proof of Szemeredi's theorem on long 
arithmetic progressions in dense subsets of the integers by Furstenberg [3], Bergelson and 
Leibman [1] proved the following celebrated polynomial generalization. 

Theorem 1 (Bergelson-Leibman). Let Pi, . . . , be polynomials in Z[x] such that Pi{0) = 
for i = 1, . . . ,k. Then any subset of the integers of positive relative upper density contains a 
configuration of the form a + Pi{d), . . . , a + Pk{d), where a, d are integers, d ^ 0. 

More recently, Tao and Ziegler [TT] proved Theorem [T] for dense subsets of the primes, using 
the general transference strategy of Green and Tao [5]. 

Theorem 2 (Tao-Ziegler). Let Pi,...,Pk be polynomials in Z[x] such that Pi{0) = for 
i = l,...,k. Then any subset of the primes of positive relative upper density contains a 
configuration of the form a + Pi{a) , . . . , a + Pk{a) , where a, d are integers, d ^ 0. 

Here for any subset A of the set of primes V, the relative upper density d-p^A) of ^ in P 
is defined as 

dp(^)=lim^_^^^^ 

In a recent preprint, Wooley and Ziegler [12| showed that the step d of the polynomial 
progression in Theorem [1] can be taken to be a shifted prime. 

Theorem 3 (Wooley-Ziegler). Let Pi, . . . , P^ be polynomials in Z[x] such that Pj(0) = for 
i = l,...,k. Then any subset of the integers of positive relative upper density contains a 
configuration of the form a + Pi{p — 1), . . . , a + Pk{p — 1), where a is an integer and p is 
prime. The same is true if we replace p — 1 with p + 1. 
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A generalization to polynomials in several variables, with a simpler proof, was subsequently 
obtained by Frantzikinakis, Host and Kra [3]. 

Our goal in this paper is to establish the following hybrid of Theorems [2] and El 

Theorem 4. Let Pi, . . . , be polynomials in Z[x] such that Pi{0) = for i = 1, . . . ,k. Then 
any subset of the primes of positive relative upper density contains a configuration of the form 
a + Pi{p — 1) , . . . , a + Pk{p — i) , where a is an integer and p is prime. The same is true if we 
replace p — 1 with p +1. 

In other words, we claim that the step d in Theorem [5] can be restricted to be of the form 
p — 1 (or p + 1). In fact, our proof shows that there are infinitely many such configurations, 
and we are able to give a lower bound on their number which is of the order of magnitude 
predicted by the Bateman-Horn conjecture. Previously, the question about the existence of 
such configurations has also been posed as Conjecture 1.2 in [9]. 

Our method is very similar to that employed in [3] , in the sense that we compare an average 
over the integers to an average along the shifted primes using multiple applications of van 
der Corput's lemma and a PET induction scheme. However, we proceed quantitatively in the 
spirit of and rely on a refined analysis of the correlation properties of the pseudorandom 
measure from that paper. 

The rest of the article is structured as follows. In Section [5] we set up our notation, and 
in Section [3] we reduce Theorem [J] to the more technical Propositions [T] and [21 We study a 
simple example of Proposition [H in Section [H and follow it up in Section [5l with a discussion of 
a modified polynomial forms condition that arises from the example, together with an outline 
of the proof of Proposition [2l The technical details of the proof of Proposition [2l can be found 
in an appendix. Finally, the general case of Proposition [H is proved in Section [6l using the 
now standard PET induction scheme. 

Acknowledgements. Work on this project began during the first author's visit to Ecole 
polytechnique, and he would like to thank the Centre de Mathematiques Laurent Schwartz 
for its hospitality. The authors would also like to thank Terence Tao for helpful discussions. 

2. Preliminaries 

We assume some familiarity with the work of Green and Tao [5j and Tao and Ziegler 
as well as with the definition and basic properties of the Cowers uniformity norms. Here 
we only briefly remind the reader of the most important definitions, lemmas and parameter 
settings from those papers. The experienced reader is encouraged to skip this section and 
consult it later as the need arises. 

Landau's O, a and Vinogradov's ^ notation are given their usual asymptotic meaning. 
That is, for two quantities X, Y , we write X <^Y,Y ^ X , or X = 0{Y) if we have a bound 
\X\ < CY for some constant C. If C depends on other parameters such as A;, then this 
dependence is indicated as X Y,Y X, or X = Ok{Y). By o(l) we denote a quantity 
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that goes to as ^ oo. If this quantity depends on other parameters such as k, then this 
dependence is sometimes indicated as Ofc(l). 

Throughout the paper, we fix a system of polynomials Pi, . . . ,Pk with integer coefficients, 
each vanishing at 0. We can certainly assume that these polynomials are distinct. We also 
fix a subset A <^ V satisfying d-p{A) = 5o > 0. All implicit constants are allowed to depend 
on 6o,Pi,...,Pk- 

To get around the fact that the primes are not equidistributed with respect to small moduli, 
we let w <C log log log N be any sufficiently slowly growing function in A^, and let W = Y\p^^ p 
be the product of the primes less than w, so that W <C log log A^. Eventually, just as in [5] 
and we will be able to take whe a sufficiently large constant, see the discussion in Section 

m 

It follows from the assumption on the density of A that there is an infinite sequence of 
integers N' going to infinity such that 

1 TV' 

|^n[iv']| > -6oz 



2 ""logiV 

We set N = IN'/2W\, and observe that the asymptotic limit as — )• oo is equivalent to 
the asymptotic limit as A^' — )■ oo. By the pigeonhole principle, we can find b = b{N) S [W] 
coprime to W such that 

|{..[Ar/21:H'. + ..^}|»J^^j-^, 

where (j) is Euler's totient function. 

The expression 'Eiy^yfiy) denotes the average of a function / over a finite set Y. Borrowing 
notation from ergodic theory, we also write for E^^xfix) and Tf{x) = f{x — 1). For 
convenience we set X equal to the cyclic group Z^v. The fact that the elements x we consider 
are restricted to lie in the interval [A'^/2] ensures that there is no problem with wrap-around 
in X. 

For a modulus W and a residue 1 <h <W coprime to W , let us define 



(1) ^Wfi;N{n) 




log(M^n + 6), if Wn + 6 is prime and 1 < n < A^, 
otherwise. 



For the purposes of this paper, a measure is a non-negative function : X — ?> [0, oo) satisfying 
1/ = 1 + o(l) and the pointwise bound v = 0^{N'^) for any e > 0. The measures we will be 
working with are of the form 

(2) J J2 M™)x(g^ 

\m\Wn+b ^ - 
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where (j, is the Mobius function and x is an even smooth function supported on [—1, 1] satis- 
fying 



In [TT], Tao and Ziegler defined a pseudorandom measure to be a measure satisfying two 
technical conditions known as the polynomial forms condition and the polynomial correlation 
condition, and they showed that v^^i) as defined in ([2]) satisfies both of these. We refer the 
reader to the precise definitions of the polynomial forms and the polynomial correlation con- 
dition in im Definitions 3.6 and 3.9]. In this paper, we will need a variant of the polynomial 
forms condition for pairs of pseudorandom measures, which we call the extra condition. It 
will be given in Section [5l where we also verify that this extra condition is satisfied by a pair 
i^Wfii 1 ^W,b2 for potentially distinct 6i , 62 . 

Let us list the important remaining parameters. 

• Let do = maxi<j<fc deg Pi denote the maximal degree of the polynomials. 

• Let M = be the "coarse scale", which serves as a bound for the step of the 
polynomial progression. We can take r]Q to be any positive number less than l/2do. 

• Let < r]i <^ rjQ he a tiny parameter, depending on Pi, ... , P^, which controls the 
degree of pseudorandomness of a measure i^. 

• Let < 7/2 ^ Vi/do, and R = N''^^ be the sieve level which is used in the construction 



We do not explicitly specify the parameters rii,r]2, but insist that they depend only on the 
system Pi, . . . , P^. and are chosen sufficiently small to accommodate all our estimates (notably 
those arising from the PET induction) . Note that Tao and Ziegler also needed the "fine scale" 
H = N^'' , but we shall not need it here. In this sense our work is much simpler than 

For completeness, we state two basic and well-known inequalities we shall use repeatedly. 

Lemma 1 (Cauchy-Schwarz). Let A,B be sets, let f,F be functions on A and let g be a 
function on A x B. If \ f\ < F pointwise, then 



Lemma 2 (van der Corput). Let (xm)mGZ a real-valued sequence satisfying Xm = outside 
the interval [M]. Then 



This lemma follows by simply expanding out the square, and is reminiscent of |1H Lemma 
A.l]. Note that, in contrast with |1H Lemma A.l] where m and h are on different scales, in 
our situation h and m are on the same scale. This fact is important for us since it will make 
the Gowers norms appear. 




of i^. 



^aeA,beBfia)gia,b)\^ < E„g^P(a)EagAi^(a) \Eh^Bg{a,b)\^ . 



^m&[M]Xm\ < ^\h\<M'^rri&[M]XmXm+h- 
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3. Overview of the proof 

The main result of Tao and Ziegler is the fohowing |1H Theorem 2.3]. We shall use it as a 
black box in the sequel, although we will need to delve into the details of the proof in a later 
part of the argument. 

Theorem 5 (Uniform polynomial Szemeredi theorem in the primes). Let v be a pseudoran- 
dom measure on X. If f is a function on X such that < / < i^, f > 6, then 

E„eM / T^^(^-)/^ / . . . T^^C^-)/^ / > c{6) - o(l) 
Jx 

for some constant c{5) > depending on 5. 



Just as in [3], we will also need the following deep result from Green and Tao's programme 
of counting linear patterns in primes (see [6l[7l[8], but also [U Theorem 2.2]). 

Theorem 6 (Green- Tao, Green- Tao-Ziegler). For every d € we have 

{b,W)=i 

Our main result will be deduced from two statements, the first of which is analogous to [3l 
Lemma 3.5]. The "extra condition" mentioned in the hypotheses of Proposition [1] below is 
quite technical, and will be defined in Section [5] (Definition [2]) . 

Proposition 1. Let 1^1,1^2 be a pair of pseudorandom measures on X satisfying the extra 
condition. Suppose that fi,---,fk o,re functions on X with < vi for i = 1, . . . ,k, and that 
a is a weight on X with support in [M] such that \a\ < 1 + V2. Then 

E„,[M] a{m)T^^^^-y^ f, • • • T^^i^^^^ f, = 0(||a||^.(Z(^,,,„)) + o{l\ 
where d is an integer depending only on the system of polynomials. 

We shall also show that a pair of measures satisfying the hypotheses of Proposition [1] 
actually exists. 

Proposition 2. For any 6 7^ coprime to W , the pair ui = i'w,b, = t^w,i of pseudorandom 
measures satisfies the extra condition. 

Remark 3.1. If we were only interested in configurations inside the full set of primes (rather 
than subsets of positive relative density), this proposition and the needed extra condition 
would be slightly simpler to state, and to prove. However, we need to be able to take 
potentially distinct distinct residue classes for the pseudorandom measures governing a and 
the /jS since the residue class h mod W on which the set A is dense was chosen by the 
pigeonhole principle. 

To conclude this section, we show how Theorem H] follows from Propositions [T] and [2J 
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Proof of Theorem assuming Propositions [I] and \^ Suppose that we are given a subset 
A (^V oi relative upper density 5q. We shall let /i = • • • = = /, where 



(3) f{x) 



^logi? \iR<x<N/2&ndWx + beA, 
otherwise. 



As remarked in Section [2l by the pigeonhole principle we can choose h such that fx f ^ 
provided that is sufficiently large. Set ui = vwb so that < / < z^i. Let 



log(M^x + 1) if R<x < M andWx + l is prime, 
otherwise. 



(in other words, g is the same function as A\y^i-m except on [R]), then there is a constant a 
such that < ag < 1^2 = fw,!- 

Set a = a{g — 1[a/]), so that |a| < 1 + z^2- Proposition [2] states that the pair ui, V2 satisfies 
the extra condition. 

Applying Proposition [1] with these choices yields 

^me[M] I a{m)TP^(^"^y'^ f • • • r^.(^-)/w^/ = 0{\\9 - 1[m] Wu'^iz,,,^,,,,)) + ^(l)- 

Since g and ^.w,i;M differ on a negligible subset of 'Zi(2d+i)M ■, Theorem [6] tells us that the 
right-hand side can be made arbitrarily small if A'^ is sufficiently large. By Theorem [5] we also 
have 

Jx 

Since PiiyVm) is much less than N/2 for m € [M] and the progressions therefore cannot wrap 
around the group Z^v, we can replace the average over X with the average over [N] and find 
that 

(4) E^e[M]E,e[jv]5(m)/(x + Pi{Wm)/W) ■■■f{x + Pk{Wm)/W) > c{6o) - o(l). 

It remains to replace g' by a suitable indicator function for the primes congruent to 1 mod W. 
Since we are only looking for a lower bound, this is straightforward to accomplish. Indeed, 
we see that the left-hand side of ([!]) is bounded above by 



MN W ^ \ W J 

times the number of pairs (m,x) G [M] x [A^] such that Wm + 1 G V, x + Pi{Wm)/W G [N/2] 
and Wx + b+PiiWm) G A for alH = 1, . . . , fc. This is equivalent to saying that for sufficiently 
large A^, the number of pairs (p, x) G \WM -|- 1] x [A^] satisfying p £ V,p = 1{W) such that 
X + Piip - 1)/!^ G [A^/2] and Wx + b + Pi{p - 1) e A ior all i = 1, k is at least 



MN f W \ 



fc+i 
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But the right-hand side tends to infinity with N, concluding the proof of Theorem HI □ 
It thus suffices to prove Propositions [T] and [2j 

4. A TOY EXAMPLE 

In this section we will study the toy example of the configuration x, x + (p — 1)^, and use it 
to motivate the definition of the extra condition in the subsequent section. (Note, however, 
that the existence of this particular configuration in the primes already follows from the work 
of Li and Pan [9]. A more general result was recently proved by Rice [10].) For simplicity we 
assume here that W = 1. 

Let 1^1, z/2 be a pair of pseudorandom measures. Suppose that /o,/i are positive functions 
satisfying /o,/i < z^i and suppose further that the weight a, which is supported on [M], 
satisfies \a\ < V2- We shall show that, under an additional assumption on z^i, z^2; we can prove 
the estimate 

(5) E = I E„g[,,]a(m)/o(x)r-Vi(x) = 0(||a||f;3(z,,,)) + o(l). 

Let us first eliminate /o from the average E. By Cauchy-Schwarz, we have 

„ /■ f ,2 

?2 



< / Mx) / i^iix) E„g[M]a(m)r'" /i(x) 
Jx Jx 



Recalling that Jj^ i^i = 1 + o(l) and using van der Corput, we have 

E^ « (1 + 0(1))/ B^^[M]M^)a{m + h)uiT"''hT^^+^^'h + o{l) 

\h\<M 

= (1 + 0(1))/ B^^[M]M^)a{m + h)T-^\^hT^^^+^'h + o{l) 

JX \h\<M 



where in the second line we shift the variable x by Ri{m) = —m?, thus making the term fi 
appear, rather than a shift of /i. Let 

Ei= [ E^,[M]a(m)a(m + /i)r«i(-)i.i/ir2™'^+'^Vi. 

■JX \h\<M 

Note that the system of shifts of /i appearing in Ei, namely 0, 2mh + h? , is "simpler" than 
the system in E in the sense that the polynomials are now linear in m. Next, we want to 
eliminate the new shift of /i from Ei. Again, by Cauchy-Schwarz, we have 

2 

E^elM] a(m)a(m + h)T''^^'^^v^T^'^^+^' h 



El < [ i^iix) [ i^iix) 
Jx Jx 

and by van der Corput this is 



\h\<M 



^ (l + o(l)) / uiEi ^^^j^T^^ a{m)a{m + k)a{m + h)a{m + h + k) 

Jx \h\,\k\<M 
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Let the last integral be E2. Shifting x by R2{m,h) = —{2mh + h?) to make /i appear, we 
obtain 



E2 = E jyi^[j^-^ a{rn)a{m + k)a{m + h)a{rn + h + k) 

\h\,\k\<M 



X 



rpR2 {m,h) y^'J'^i {'m)+R2 {m,h) ji-Ri {m+fc)+_R2 {m,h) j^rp2kh 



Again, the system of shifts of fi is now simpler than the previous one, in that it does not 
depend on m at all. (Tao and Ziegler deduced from this step their generalized von Neumann 
inequality, which bounds E in terms of an averaged local Gowers norm of /i.) We repeat 
the same process one more time to eliminate fi completely from the average. Indeed, by 
Cauchy-Schwarz, E2 is less than or equal to 

E\h\,\k\<M viT'^^'^y^ (E\h\^\k\<M viT'^'^^yi E„e[M]a(m)a(m + k)a{m + h)a{m + h + k) 

y J^R2 (m,h) jy^J^Ri i'm)+R2 (m,h) y^'J'Ri (m+k)+R2 {'m,h) 



Since z^i satisfies the polynomial forms condition [IV, Definition 3.6], the first factor is l + o(l). 
By van der Corput, the second factor is at most 

E ™g[A/], n a{m + UJ-{l,k,h)) [ T^2{rnMj,^T^l{rn)+R2{rnMj,^T^l{rn+k)+R2{rn,h)j,^ 
|fc|,|fc|,|/|<A/^g|0,l}3 ■'^ 

^ j^R2 (m+l,h) ^_^rpRl im+l)+R2 (m+l,h) ^^j^Ri (m+k+l)+R2 (m+l,h) 

If it were not for the presence of the integral, then this would be equal to 

E 

|h|,|fc|,|Z|<M^g|0^1|3 

which would give us the desired estimate, since the latter quantity is bounded above by a 
constant times l|o||^3(-2^^^)- Indeed, we trivially have 



^TF4 Yl n aim + iv{l,k,h)). 



m,h,k,leZrM wg{0,l}3 

Identifying Z^m with the integers in (— 3M, 4M], we see that since a is supported on [M], the 
term HajGio i}^ + a; • (/, k, h)) is non-zero only if m G [M] and |/| < M . For these 

m, /i, /c, /, the representative of m + w • k, h) in (— 3M, 4M] is m + w • k, h) itself, for any 
oj G {0, Ip. Thus 

ll«ll^3(z,^^) » ^ n a{m + u;-{l,k,h)), 

\h\,\k\,\l\<M 

where m, h, k, I are now elements of Z, and the claimed bound follows after renormalization. 
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To continue, let us write rh = (m, h,k,l) G Z^, the integral as DLi ^^^ere 
Qi € Z[m] for i = 1, ... ,6, and the weight in front of the integral as ri|=i 0'{Lj{m)), where 
for j = 1, . . . , 8 the Lj are linear forms defining the eight vertices of the parallelepiped in the 
^3 norm. Also, let = {{m,h,k,l) G : m G [M], \h\, \k\, \l\ < M}. We want to show 
that 

(6) F = E^g^^, na(L,(m)) x J] r«>(™)z.i(x) - 1 j = o(l). 
Recalling that \a\ < i^2, by Cauchy-Schwarz we have that 

8 / 6 \ ^ 

j=i i=i / 

Since 1^2 is pseudorandom, we have ||'^2||r/3(Z7M) = 1 + (Note that a priori 1^2 is a 

pseudorandom measure with respect to N, but by choosing R, the sieve level in the definition 
of 1^2; sufficiently small, we can ensure that 1^2 is also pseudorandom with respect to M.) 
By squaring out ^ Y\i=i T^^^^^vi{x) — 1^ , we see that it suffices to show that 

8 / 6 \ ^ 

(7) E^,f,,,,n^2(i^i(m))x /" nr«'(-)z.i(x) =1 + 0(1) 

j=i V-^^ i=i / 

for k = 0, 1, 2. But it is precisely expressions of this type that will be governed by our new 
"extra condition", which we shall formally introduce in the next section. 

Remark 4.1. It is well known that if V2 is a pseudorandom measure, then so is {v2 + l)/2. It 
will be easy to see that if the pair vi,V2 satisfies the extra condition, then so does the pair 
'^i) (^2 + l)/2- The above result therefore also applies to the case where |a| < 1^2 + 1, which 
is what we need in the proof of Theorem HI 



5. A DISCUSSION OF THE POLYNOMIAL FORMS CONDITION 

Let us recall Tao and Ziegler's definition of the polynomial forms condition [IH Definition 
3.6], of which the extra condition will be a variant. 

Definition 1 (Polynomial forms condition). A measure v : X ^ is said to satisfy the 
polynomial forms condition if for any family of polynomials Qj G Z[mi, . . . , m/j], j G J, 
satisfying 

• the difference Qi — Qj is not constant for i ^ j; 

• the number of polynomials \J\ and the number of variables / are bounded by 1/r/i; 

• the total degree of each Qj for j G J is at most do, and all coefficients are at most 
CW^^ , where C is a constant (depending on Pi, ... , P^,); 
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we have 

(8) ^Kennzo [ J] ^""^^'^K-) = 1 + 

for any convex body O C of inradius at least N"" and contained in the bah 5(0, M^). 

Inequahty ([7]) does not exactly follow from Definition [H but we still can deduce it using 
Tao and Ziegler's machinery. We make the following general definition. 

Definition 2 (Extra condition). A pair of measures i^i,z^2 : ^ ^ is said to satisfy the 
extra condition if for any family of polynomials Qj E Z[mi, . . . , m/j], j G Ji, and any family 
of linear forms Lj : TP — ?> Z, j € J2, satisfying 

• the difference of polynomials Qi — Qj is not constant for i j; 

• the number of polynomials |Ji| and the number of variables D are bounded by l/rji; 

• the total degree of each Qj for j € Ji is at most d^, and all coefficients are at most 
CW^^ where C is a constant (depending on Pi, ... , P^); 

• the linear forms Lj, j G J2 are pairwise linearly independent; 

• the number of linear forms | J2I is bounded by l/ryi; 

• the coefficients of each Lj are or 1; 

we have 

(9) ^rn^M^n H ^^^^M)) if H ^^^^ + ) = ^ + ^^^^^ 

for k = 0,1,2, where ^m,d = {{fn,hi, . . . ,h£)^i) G : m G [M], < M for any i = 
1,...,D-1)}. 

Remark 5.1. The extra condition is tailor-made to suit our needs. One could merge it with the 
polynomial forms condition to obtain a more general statement, but this appears unnecessary. 

Let us now show that for any 6/0 coprime to W, the pair i^i = vw,b-, = i^w,i given by 
(l2|) satisfies the extra condition. That is, we shall turn to proving Proposition [2j To begin 
with, let us recall some more definitions from [llj . 

Definition 3 (Good, bad and terrible primes). Let Pj G Z[xi, . . . j G J, be a family of 
polynomials. We say a prime p is good with respect to the family Pj, j G J, if 

• the polynomials Pj (mod p), j G J, (considered as elements of Fp[xi, . . . , x/)]) are 
pairwise coprime; 

• for each j G J, there is a variable Xi such that Pj can be expressed as Pj = Pj^iXi + Pj^ 
where Pj^i,Pj^o £ Fp[xi, . . . , Xj-i, Xj+i, . . . , xd] are such that Pj^i is non-zero and 
coprime to Pj^o- 
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We say p is had if it is not good. We say p is terrible if at least one of the Pj vanishes 
identically mod p. 

We shall need the following slight variant of the basic correlation estimate [111 Proposition 
10.1], in which we now have a pair of pseudorandom measures. 

Proposition 3 (Correlation estimate). Write vi = i^wM o-i^d = vw.hz with 61,62 7^ and 
coprime to W. Let Ji, J2 C N be two disjoint indexing sets, and let J = Ji U J2. For j G J, 
let Pj S Zi[xi, . . . ,X£)] have degree at most d. Let Q be a convex body in of inradius 
at least W^^-^^^^. Let Vb be the set of primes w < p < R^°sR which are bad with respect to 
{(WPj + (WPj + b2)j£j2), and suppose that there are no terrible primes in the same 

range. Then 
(10) 

E.ef^nzi' n n ^2(^'j(^)) = 1 + od,jA^) + OD,j,d I Exp ( Od,j4 I E " ) ) | ' 

Here we have written Exp(x) = max(e^ — 1,0), so that Exp(x) ^ x when < x ^ 1. 

The proof of |1H Proposition 10.1] generalizes readily to yield Proposition [3l However, 
since it is relatively complex and buried in various appendices of a long paper, we give the 
details for the convenience of the reader in the appendix to this paper. 

To conclude, let us deduce the extra condition from the above correlation estimate. 
Proof of Proposition [H assuming Proposition 0- Recall that we want to show that 

(11) E,^go„,^ n ^2(ii(m)) [I J{ui{x + Qj{m)) 1=1 + 0(1) 

for k = 0,1,2, where z^i,i^2 are defined as in Proposition [3l If A; = 0, then the integral 
disappears, and we are left to show that 

EmeHM.D n ^2(^2(m)) = 1 + 0(1). 

If we choose R sufficiently small in terms of M, then 1^2 is pseudorandom with respect to M, 
and (jlip simply follows from Green and Tao's linear forms condition in [SJ Definition 3.1] in 
this case. 

Let us now discuss the case k = 2 (the case /c = 1 is even simpler). First we make 
a reduction, replacing the integral on X with the average E^g[^], thus regarding ui as a 
function on Z rather than X. The values of i'i{x + Qj{rh)) may be different when i^i is 
regarded as a function on Z because of the wrap-around effect, but they must agree whenever 
1 < X < iV - 0{{WM)'^oy Recah that we also have the bound z^i <e for any e > 0. Thus 



(12) / J] Mx + Qj{rn)) - E,e[7V] H + ^i(^)) «^ {WM)''^'-^ 
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for any e > 0, and pT]) follows if we can show that 

(13) E^e^M,!, n ^2(i>j(m)) E,g[^] Yl u^{x + Qj{m)) = 1 + o(l). 
Expanding out (fT3]l . we see that it is equivalent to 

(14) E^GOM,D,a;,x'G[Ar] n ^2{Lj{rh)) J| z^i(x + Q j{m))vi{x' + Qj(m)) = 1 + o(l). 

Now this expression falls within the scope of Proposition [31 the polynomials in question are 
Li{rn) , X + Q j{m) , x' + Q j{fh) , in variables m,x,x' . For this system, there is no terrible prime 
greater than w, and the only bad primes greater than w are those dividing Qj — Qji for some 
j / /. Therefore, the left hand side of ^ equals 1 + o(l) + O (^Exp {o (Epen?" 0))' 
where Vi, denotes the set of primes dividing Qj — Qj/ for some j 7^ j'. 

But just as in the proof of [HI Corollary 11.2] , if a prime p divides Qj — Qji for some j ^ j' , 
then p must divide a non-zero difference of the coefficients of the QjS (recall that Qj — Qji 
are not constant for any j 7^ j). These coefficients are bounded by OiyV^^), so that the total 
product of such p is at most 0{W^^^^). As a result, the number of p G "Pfe (which are greater 
than w) is at most \og{0{W'-''^^'>)) / \ogw = o(logVF). But then YlpeV^P^^ < YlpeVh"^^^ ~ 
0(1), since logVF <C . □ 

Remark 5.2. The extra condition is in a sense simpler than Tao and Ziegler's full polynomial 
forms condition, in that all the variables are at scale M, whereas the polynomial forms 
condition ([HI Theorem 11.1]) makes a statement about convex bodies of inradius as small 
as A^*^. This explains why Tao and Ziegler had to do some extra work to prove the polynomial 
forms condition from the correlation estimate, while for us it is almost immediate. 



6. The general case 

As is to be expected, we proceed by PET induction to prove Proposition [1] in the general 
case. We follow the notation in with the simplification that the dimension is equal to 1. 

Given a family of polynomials Q = {qi, ■ ■ ■ ,qk) in a variable n (and possibly in other 
variables), the maximum of the degrees of the Qi (with respect to n) is called the degree of 
the family Q. We work with families of polynomials whose degree is smaller than or equal to 
a fixed number s. 

For i = 1, . . . ,k, define Q' to be the possibly empty set 

Q' = {qi € Q : (7i is constant in n }. 

Two polynomials are said to be equivalent if they have the same degree and the same leading 
coefficient (in n). For j = 1, s, let Wj denote the number of distinct non-equivalent classes 
of polynomials of degree j in Q\Q' . Finally, define the type of the family Q to be the vector 
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{wi, . . . ,Ws). A family is said to be of type zero if all the Wj are zero, in which case all 
the polynomials are constant (in n). The set of types can be ordered lexicographically, by 
stipulating that w = {wi, . . . ,Ws) < w' = (w'l, . . . ,w'g) if there exists d such that Wd < w'^ 
and Wj = w'j for all j > d. 

It follows that any decreasing sequence of types of families of polynomials is eventually 
stationary, and thus any inductive process that reduces the type must eventually stop. 

Given a family Q = (^i, . . . , g^), g G Z[t] and /i G N, define following ^ the van der Corput 
operation (g, /i)-vdC(Q) by setting 

iq,h)-vdCiQ) = {ShQ-q,Q-q), 

where Shqin) = q{n + /i), ShQ = (Shqi, Shqk) and Q - q = {qi - q, ■ ■ ■ ,qk - q)- 
The crucial observation is the following [1, 3J. 

Lemma 3. Let Q be a family of polynomials of non-zero type. Then there exists q € Q\Q' 
such that for all /i € N, the family (g, h)-vdC{Q \ Q') has strictly smaller type than Q. 

For example, in the toy example in Section HI we passed from type (1,0, 1) to (1, 1,0) to 
(2,0,0). 

In the van der Corput operation, we focus on a single variable n. However, we also need 
to keep in mind that the polynomials in Q and (g, /i)-vdC(Q) are multivariate, with a new 
variable being introduced at each step of the van der Corput operation. This is important 
when verifying the hypotheses of the (polynomial forms or extra) condition as we apply them 
to the averages arising throughout. 

Proof of PropositionU^ Let us recall that we have functions fi,i = 1, . . . ,k satisfying < z^i, 
and a weight a supported on [M] satisfying \a\ < 1^2 ■ We start with the average 



E = E„e[M] [ aim) J] T'^^,, 



qieQ 

where the family Q initially consists of the polynomials qi{m) = Pi{Wm)/W , j = 1,. . . ,k. 
Define Q' C Q to be the subset of polynomials which are constant in the variable m. (By 
hypothesis on the Pi, this actually means that the q Q' are identically 0, but later on, with 
additional variables, this is not necessarily the case.) 
We first apply the Cauchy-Schwarz inequality to obtain 




q^eQ\Q' 

By the properties of z^i, the first integral is l + o(l) (if Q' = 0, we interpret the empty product 
as equal to 1), and we can bound the second factor by van der Corput by a constant times 

/ n ^'^iEme[M]a(m)G(m + /ii) H T''>(-)/ir««(™+'»^)/, + o(l) 

•^^qeQ' \hi\<M q,£Q\Q' 
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\Ei?< 





( f 











Now by Lemma[3l there exists qi € Q\Q! such that for all hi, the family Qi := {q^^\ /ii)-vdC(Q\ 
Q!) has strictly smaller type than Q. We also write := Q! — q^^^ for the recently deceased 
nodes. With this notation, shifting by g*-^-* gives, up to an error term, the expression 

Ei= I E„e[M]a(m)a(m + /ii) [] TVi \{ T'^^fj^, 

where the belong to the set {/i, . . . , /fc}. There is no need to keep track of them, so we 
shall simply write / with no subscript in the sequel. 

Write Q'^ for those polynomials in Qi of degree in m. By Cauchy-Schwarz, 

By the polynomial forms condition, the first factor is l + o(l), and the second can be bounded 
above by van der Corput as a constant times 

/ E m£[M], a{m)a{m + hi)a{m + h2)a{m + hi + h2) 

Jx |/ii|,|/i2|<'m 

q&Ql qeQi\Q[ 

Of course the dependence of the polynomials on hi is suppressed here. By Lemma O there is 
g*^^) G Qi \ Q'l such that the family Q2 := (g*-^'', /i2)-vdC(Qi \ Q'^) has strictly smaller type. 
Define also Q2 '■= {Q'l — q^^'') U (qJ — q^"^^), leading after rearranging to 

E2 = jE rn£[M] a{m)a{m + hi)a{m + h2)a{m + hi + /i2) T'^i'i JJ T'^f. 

Continuing in this vein, we set at step s Qs+i := ((?^*^"^\ /is+i)-vdC(Qs \ Qg) and Q^+i := 
(Q's - g^'""^^^) U {Q\ - ah the while reducing the type. We also set Qo = Q, QJ = 0, so 

that the above recursive definition is valid for all s > 0. By Lemma[3]we reach a point, at step 
t say, where Qt = Q'l- In other words, the system is of zero type and all active polynomials 
are of degree in m. 

In order to be able to use the polynomial forms condition at every step, we need to ensure 
that no two polynomials in differ by constants. We will in fact prove a slightly stronger 
statement, which we shall need later. 

Claim 1. For any < s <t, no two polynomials in Qs U QI differ by constants. 

Proof of ClaimUl- It is easy to see that all our polynomials are when all the variables are 0. 
Therefore, it suffices to show that all the polynomials in Qs U qI are distinct as multivariate 
polynomials. We prove this by induction on s. When s = 0, this follows from our assumption 
that the polynomials Pj are distinct. Suppose we know already that all the polynomials in 
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Qs U QI are distinct. By definition, the family Qs+i U QI_^i is obtained by subtracting 
from all the polynomials in the family Sh,+^{Qs \ Q^) U (Q^ \ Q'J U U qJ. Thus it suffices 
to show that all polynomials in the latter family are distinct. Recall that the polynomials 
in Qs \ Q's distinct and non-constant in m. Thus the polynomials in Sh^^-^{Qs \ Q^) are 
distinct from each other and from the rest, since they have a new variable, namely /ig+i, and 
are non-constant in this variable. The remaining polynomials from (Qs \ Q^) U Qg U Q| are 
distinct by induction hypothesis since {Qs \ Q'g) U Q'g = Qs- □ 

Claim [1] shows that our calculations so far have been valid. Returning to step t, we have 



Et = E n <^irn + LO-{h,...,ht))llT^iyillT'^f, 

\hi\,...,\ht\<M ^^^Q^iy ^ggt qeQ[ 



and we apply Cauchy-Schwarz one more time to get 



\Et\ < I l^i^\hi\,...,\ht\<M 





\hr\,...,\ht\<M 



EmG[M] n a{m + uji-{hi,...,ht))WT'iui 



je{o,i}* 



A final van der Corput gives 



E 



mg[M], 



/ll|,...,|/it + l|<M^g|Q 



n a(m + w • (/ii, . . . , ht+i)) j n ^'^"^^1 n T^^^^^^^iT^^^ 



m+ht+i) 



^1, 



and we write F for the difference between this expression and E m€[M], Y\u;e{o i}*+i ^^("^ + 

\hi\,...,\ht\<M 

oj ■ {hi, . . . ,ht+i)). As in the example in Section [H by Cauchy-Schwarz we have that F'^ is 
bounded above by ||j^2ll?rt+i^v ^ times the average 



(15) 



E 



^(2t+3)M; 

'm,hi,-,ht+ie[M] n i'2{m + uj ■ {hi,...,ht+i)) 

a;e{0,l}t+l 

/ 



In order to use the extra condition on the last expression, we need to verify that no two 
polynomials in qJ U Sht_f_i Q| U Q[ differ by constants. Again, since all these polynomials are 
when evaluated at 0, it suffices to show that they are distinct. Before seeing this, let us 
make some observations. 



Claim 2. For any < s < t, for any polynomials p & QI and q & Qs\ Q's, p — q is not 
constant in m. 
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Proof of Claim\^ For s = there is nothing to prove. Suppose the claim is true for s < t — 1. 
Let p e Ql_^.l and q € Qs+i \ Q's+i- Write p = u- for u e Q'^U qI and q = v - 

for V G (Qs \ Q'g) U Shs+i{Qs \ Q's)- It remains to see that u — v is not constant in m. If 
u & Qg, then u is constant in m, but none of the polynomials in {Qs \ Q's) U Sh^+i (Qs \ Qs) are 
constant in m. Suppose u G qI. If u G Qs \ Q'^ then n — u is not constant in m by induction 
hypothesis. If v £ Sh_^^-^^{Qs \ Q'g), we write v{m) = w{m + hs+i) for some w G \ Q's- Then 
n(m) — w(m + hs+i) is not constant in m, since it is already not constant in m upon setting 
/is+i = 0. This proves Claim [21 □ 

Claim 3. For any < s <t, the polynomials in Ql are not constant in m. 

Proof of Claim\^ Indeed, if s > 1 then by definition we have 

Q\ = {QU-q^^y)Vj{Q',_,-q^^^) 

for some q^^^ G Qs\Q!s- Since the polynomials in are constant in m, the polynomials in 

Q!s_i —q^'"^^ are not constant in m. From Claim[21 we know that the polynomials in — g'-'*^ 
are not constant in m. □ 

From Claim [1] we know that the polynomials in Q| U Q!^ are distinct. The polynomials in 
^ht+iQd have a new variable, namely /it+i. From Claim [3] we know that they are not constant 
in hs+ii hence distinct from Q\ U Q!^. 

It is also easy to see that the total degrees of the polynomials appearing in this process are 
not increased, so they are always at most do- Also, all of their coefficients can be bounded 
by a constant C times the maximum of the absolute values of the coefficients of the original 
polynomials (namely PiiWm) /W), where C depends only on Pi, . . . It follows that all 
polynomial expressions in (jl5p satisfy the hypotheses of the extra condition. As in Section U 
expanding out the expression in (|15p and using the extra condition to see that it equals o(l) 
concludes the proof of Proposition [TJ □ 

7. Concluding remarks 

Our proof actually gives a lower bound for the number of desired configurations. More 
precisely, it shows that the number of pairs (n,p) G [A^] x [M] for which n + Pi(p — 1), . . . ,n + 
Pk{p — 1) and p are all prime is at least cA^M/(log A^)'^^^, as long as M grows like a power 
of N that is at most iV^/^rfo _ j^gt as in [llj, this follows from the proof of Theorem [5] since 
we can choose w to be arbitrarily slowly growing (see also the more detailed discussion at 
the start of [5, Section 11]). This is of the correct order of magnitude if one assumes the 
Bateman-Horn conjecture. 

One can also see that Proposition [1] remains true if we apply it to functions fi satisfying 
\fi\ < 1 instead of \fi\ < u. Indeed, under this condition, the proof of Proposition [1] is even 
more straightforward: at each step, in place of the factor ^ HgeQ' T'^'^i^ = 1 + o(l), one 
simply has a constant 1. Thus, taking /i = . . . = to be the characteristic functions of a 
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set A C [A^] of density 6 and using the uniform Bergelson-Leibman theorem |1H Theorem 
3.2], one actually obtains a slightly different proof of Theorem [3l It is, of course, in the same 
spirit as [3, Theorem 1.2], but has the advantage that it gives a lower bound on the number 
of configurations. More precisely, we have the following result. 

Proposition 4. Let k > 0. Suppose N"^ < M < Ari/2rfo_ Then for any 6 > 0, there is a 
constant c{n,6) > such that the following holds. Let A be any subset of [N] of density 6. 
Then A contains at least c{k, 6)NM/ log M configurations of the form a + Pi{p — 1) , . . . , a + 
Pk{p — 1), where p < M is a prime. 

This bound does not follow from [3j. On the other hand, the proof of Frantzikinakis, Host 
and Kra shows that if A C Z and d{A) > 0, then 

d{A n{A-P^{p-i))---n{A- Pkip - 1))) > 

for p in a set of positive relative density in the primes. 

Appendix: The generalized correlation estimate 

In this appendix we point out the modifications that need to be made to the proof of [11^ 
Proposition 10.1] to obtain Proposition [3] (correcting some misprints from |11] in the process). 
Recall that we had ui = vwM and V2 = and two disjoint indexing sets Ji, J2 C N, 

J = Ji U J2. For j € J, we have polynomials Pj G Z[2;i, . . . , X£)] of degree at most d. The 
convex body VL C was assumed to have inradius at least R^^'^^^^. We denoted by Vh the 
set of primes w < p < R^°^^ which are bad with respect to {{WPj + 6i)jgj^, {WPj + 62)jGJ2)j 
and assumed that there are no terrible primes in the same range. 
Proof of Proposition We wish to estimate 

E.ennz^ n "^(P^i^)) n ^2(Pi(x)). 

Expanding this out in terms of the definitions of 1^1,1^2, we find that 
(16) 

(^.--)"e e (nM».).K).(ii:^).(^;^ 

^ ^ jeJmj,rn'->l \jGJ \ t> / \ t> 

Ex-ennZ^ hcm{mj,m'j)\WPj{x)+bi hcm{mj ,m'.)\W Pj{x)+b2^ 

where lcm(a, b) denotes the least common multiple of two integers a and b. Setting M = 
lcm((mj)jgj, {m'j)j^j), we observe that M can be assumed to be square-free (due to the pres- 
ence of the Mobius function) and of size at most i?^!"^! (due to the restrictions on each mj,m'j 
imposed by the cutoff x)- Each function x ^ licm{m,,m;)|iyp,{x)+bi ' ^ ^ hcm{mj,m'.)\WP,{x)+b2 
is periodic with respect to M ■ Z^, and can therefore be defined on Z^. By [HI Corollary 



18 THAI HOANG LE and JULIA WOLF 

C.3], we have 



lcm(mj ,mj )\WPj {x)+bi 



Ij IX IX \cm{mj,m'j)\WPj{y)+bi^ 

i=i,2jeJi 

where the O error term is easily seen to result in an additive o(l) error, and will therefore be 
negligible. Setting 

1=1,2 j^Ji 

it therefore suffices to show that 
(17) 

\J 



(lcm(mj,m^O)iGJ 



^ ^ i6Jmj,m;>l VeJ \ B / \ & / J 

= 1 + OD,J,d(l) + OD,J,d (exP (^OD,J,d 1^ ^ j j j • 

By the Chinese remainder theorem a is multiplicative in the sense that if lcm(mj,m^) = 
UpP'"'^^^ then 

P 

where the latter is a finite product. Since the mj are assumed to be squarefree, rj{p) is either 
or 1 for each j and each p, and we obtain 

a(lcm(mj,m^.))^SJ = JI'^P'^^^^J' + ^1 )iG Ji .r^ (p)=l , (W^-Pj + h) jeJ2,rj{p)=l) , 
V 

where the local factor Cp{Pi, . . . ,Pk) is defined by 

Cp{Pi, ...,Pk)= Ej^gpo H lp,(y)=o(p)- 

So the left-hand side of ([T7D becomes 

(-) (^-«)"e e (nM™...K,.(J;:|^).(i;:g 

^ ^ jeJ nij,m'.>l \jeJ \ to / \ b 

p<Ri°sR 

where we were able to restrict the product to primes less than because each mj is 

bounded by R. 
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We can now replace x by terms which are multiphcative in rrij , m'j , using the Fourier 
expansion 

poo 

X{x) = e-- / 0(Oe-^"«de 



for a smooth and rapidly decaying function </>. We have 

/logrn^X r A<c^ ~-,Ac u 1 + i^j 

Setting up the corresponding notation involving rn'-,z'- and ^'^^ ([TH]) becomes 
(19) 

\ogR] 5] r ■■■ r in f^irnM^)rnj''^7^''m'Pi^j)dCjdC^ 



p<^logfl 



which can be rewritten in the form 



(^logR) r...r n Ej]j<p{c,m'^)d^jdeA, 



p<RiosR Vie J 

where 

jeJ mj,m'.€{l,p} \jeJ / 

One now approximates the Euler factor Ep by 

using a series of claims for different types of primes p, whose proofs we shall postpone until 
the end of the section. 



Claim 4 (Small primes). 

Claim 5 (Bad but not terrible primes). 

n | = i + ofExpfofEf-']]] 

»<p<iji°f5« ^ \ \ \pen / / / 

p bad but not terrible 
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Claim 6 (Good primes). 

n f=i+°(i) 

Pi 



Together Claims IH [5] and [6] imply that if there are no terrible primes > w, then 

\J\ 




w<p<R^° 

The proof of Proposition [3] is now completed, exactly as in |1H Proposition 10.1], using some 
elementary theory of the Riemann function, as well as the rapid decay of (j). □ 

Finally, for completeness, we give the proofs of Claims HI [5] and [6l They rely on the 
rather elementary [IH Lemma 9.5], which itself is proved with the help of a combinatorial 
Nullstellensatz [TTl Appendix D] . 

Proof of Claim^- If p < w, then for all j € Jj we have WPj + bi = bi ^ 0(p), so for such p 
the local factor Cp[{WPj + bi)j^j^^j.j{p)=ii i^Pj + ^'i)jeJ2,rj(p)=i) equal to unless the set 
{j G J : rj{p) = 1} is the empty set, in which case it equals 1. The former case happens 
if and only if all rrij are equal to 1, so that Ep = 1. A direct computation, using the fact 
that w goes to infinity much more slowly than R, shows that E'^ = (1 — p^^)!-'! + o(l). The 
estimate for the product over all primes p < w oi Ep/ E'^ then follows from the fact that 

Proof of Claim\^ li p > w \s bad but not terrible, then |111 Lemma 9.5 (b)] implies that 
the local factor Cp of any non-trivial family is 0{p^^). Hence the sum defining Ep has a 
contribution of 1 from rrij = m'- = 1 for all j G J, and a contribution of 0(p~^) from all other 
terms, so that Ep = 1 + 0{p^^). Also, we find by Taylor expanding that 

(20) ^;=i--E f^+4V-E^+o(4) = i+o(- 

where the last inequality follows from the fact that Zj^z'j have real part 1/logi? > 0. It 
follows that 

n n O+o©)- n -p(oG 

p bad but not terrible p bad but not terrible p bad but not terrible 

which equals 

exp I o I > ; I I = 1 + O _ 

V V \peVb J J J 

□ 
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Proof of Claim If p > u; is good, then by [11] Lemma 9.5 (c)] the local factor Cp of a 
non-trivial family consisting of precisely 1 polynomial is p^^ + 0{p^^). Such a non-trivial 
family arises in the case when rrij = l,"ij- = p or rrij = p,m'j = 1 or rrij = p,m'j = p for 
exactly one j € [J]. In all remaining cases, Cp = 0{p^'^) by [11, Lemma 9.5 (d)]. This implies 
that 




The statement in Claim [6] now follows from the fact that the product Yipi^ + 0{p ^)) is 
convergent and w tends to infinity. □ 
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